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HIGH AVAILABILITY SYSTEM AND METHOD FOR IMPROVED 

INITIALIZATION 

Field of the Invention 

The present invention relates generally to connmunication systems 
and, in particular, to the initialization of high availability systems. 

Background of the Invention 

The fixed network equipment (FNE) of today's wireless 
telecommunications systems is expected to provide services on the order 
of 99.999% of the time. Thus, the high availability systems that make up 
the FNE must be designed to handle the inevitable component failures 
with minimal impact to user services. One problem that can occur in high 
availability systems is the failure of device cards during power-up and 
initialization. Particularly, problematic are failures that lock up a common 
communication bus that serves many or all of the device cards in a 
system chassis. These failures effectively disable the entire system and 
are not remedied by re-initializing the system. Thus, the system is down 
until the device card can be manually replaced or at least removed, likely 
impacting user services for a significant period of time. Therefore, a need 
exists for a high availability system and method of initializing that address 
failures that lock up common communication buses in these systems. 
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Brief Description of the Drawings 

FIG. 1 is a block diagram depiction of a high availability system in 
accordance with a preferred embodiment of the present invention. 

FIG. 2 is a logic flow diagram of steps executed by a high 
availability system in accordance with a preferred embodiment of the 
present invention. 

Description of a Preferred Embodiment 

To address the need for a high availability system and method of 
initializing that address failures that lock up common communication 
buses in these systems, the present invention avoids powering-up 
peripheral components that have previously locked up the bus. It 
accomplishes this by storing indicators of successful initializations in 
memory, and then subsequently powering-up components only if such an 
indicator was stored for that component's last power-up. 

The present invention can be more fully understood with reference 
to FIGs. 1-2. FIG. 1 is a block diagram depiction of high availability 
system 100 in accordance with a preferred embodiment of the present 
invention. The preferred embodiment comprises controller 101 and 
peripheral devices 102-103, all of which are interconnected by bus 109. 
Although system 100 is preferably a Compact Peripheral Component 
Interconnect (cPCI) system (specifically, an HA 8216 system available 
from the Motorola Computer Group, a division of Motorola, Inc. of 
Schaumburg, Illinois), system 100 could instead be a VersaModule 
Eurocard BUS (VME BUS) system, for example, or any other bus system 
that allows power to be controlled for individual peripheral components. 
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Peripheral devices 102-103 may include devices such as 
telecommunications networking cards, host bus adapters, and video 
interface cards, for example. Also, although only two peripheral devices 
are depicted in FIG. 1. it is understood that the present invention is not 
5 limited to any particular number of peripheral devices in the system. The 
present invention is applicable to any number of peripheral devices that 
the system bus can support and for which power is individually controlled. 

Controller 101 preferably comprises non-volatile memory 105, 
watchdog timer 106, power applicator 108, and processor 107. Preferably. 
10 power applicator 108 comprises a Hot Swap Controller (HSC) that 
provides DC power to peripheral devices 102-103, and processor 107 



1^ comprises one or more microprocessors and memory devices. In the 

M preferred embodiment, under the control of software / firmware algorithms 

stored in the memory devices of controller 101, controller 101 performs 
15 those control tasks well-known to high availability, computer system 
ry operation and. additionally, the method described relative to FIG. 2. 

Operation of preferred high availability system 100, in accordance 
with the. present invention, occurs substantially as follows. When system 
100 initializes, as at power-up or upon a reboot, controller 101 applies 
20 power to peripheral devices 1 02-103 individually according to a predefined 
power-sequence. It will be assumed for purposes of illustration that 
peripheral 103 follows peripheral 102 in the power-up sequence. 

During initialization, then, processor 107 preferably instructs power 
applicator 108 to apply power to peripheral 102. However, prior to 
25 applying power to peripheral 102, processor 107 preferably clears the 
contents of a location in a memory array stored in non-volatile memory 
105 that corresponds to peripheral 102. Upon applying power and then 
receiving an indication that peripheral 102 has initialized, controller 101 
stores an identifier of the first peripheral. Preferably, this identifier is a 
30 value stored in the location in the memory array in non-volatile memory 
105 that corresponds to peripheral 102. Thus, since the location in the 
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memory array serves to identify peripheral 102, the value that is stored 
can be anything that denotes peripheral 102's successful initialization, just 
as "clearing the contents" is equivalent to storing any value that indicates 
that peripheral 102 has not yet successfully initialized. Peripheral 102 
5 having initialized, processor 107 preferably instructs power applicator 108 
to apply power to the next peripheral in the power-up sequence, 
peripheral 103. 

If, instead of successfully initializing, peripheral 102 locks up bus 
109, controller 101 restarts the power-up sequence, preferably by re- 
10 initializing system 100. Preferred controller 101 comprises a hardware 
p clock that serves as watchdog timer 106. Whenever bus 109 locks up, 

O watchdog timer 106 expires and triggers a re-initialization of system 100. 

H Prior to reapplying power to peripheral 102 (and thereby locking up 

IP the bus, re-initializing, and endlessly repeating this cycle), controller 101 

!^ 15 determines that an identifier of peripheral 102 was not stored. Controller 

flj 101 is able to make this determination because the contents of non- 

p volatile memory 105 survive re-initializations and power-downs of system 

O 100. Preferably, processor 107 reads the location in the memory array in 

non-volatile memory 105 that corresponds to peripheral 102 and 
20 determines from the value read that peripheral 102 did not successfully 
initialize during the last power-up. That is, the value read does not denote 
that peripheral 102 successfully initialized. 

Based on this determination, controller 101 skips peripheral 102 in 
the power-up sequence by not applying power to it. Thus, the present 
25 invention avoids the scenario in which peripheral 102 locks the bus during 
initialization, thereby disabling all of high availability system 100 until 
peripheral 102 can be serviced. For users who constantly rely on system 
100's availability, extended downtime such as this is at least unacceptable 
if not catastrophic. The present invention enables system 100 to re- 
30 initialize without peripheral 102 and otherwise provide the service that its 
users depend on it to provide. Thus, although peripheral 102 has failed 
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and will require servicing, the impact to system 100 users is minimized in 
view of the device failure. 

FIG. 2 is a logic flow diagram of steps executed by a high 
availability system in accordance with a preferred embodiment of the 
5 present invention. Logic flow 200 begins (202) when the system is 
powering up its peripheral components according to a power-up 
sequence. The logic flow ends (220) when the system completes (204) its 
power-up processing for each component in the power-up sequence. 

If components remain in the power-up sequence the next 
10 peripheral is processed. The system checks (206) this peripheral's 
Q memory array location, which is preferably stored in the system's non- 

y volatile random access memory. If the contents of this location indicate 

H that the component successfully initialized on its last power-up (i.e., a 
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particular marker is stored for this component), then the system clears 
- 15 (208) the contents of this component's memory array location and applies 

lU power (210) to the peripheral component. 

If the component initializes (212) without locking up the bus, then 
an identifier of the component is stored (214). Preferably, this identifier is 
a value stored in the location in the non-volatile memory that corresponds 
20 to the peripheral component. Thus, since the location in this memory array 
serves to identify the peripheral, the value that is stored can be anything 
that denotes the peripheral's successful initialization. The logic flow then 
returns to the beginning to process to next component in the power-up 
sequence. 

25 Instead, if the peripheral unit upon powering-up, locks up the bus, 

then the watchdog timer will expire and trigger a re-initialization (216) of 
the system. The logic flow returns to the beginning and the power-up 
sequence is restarted with the first component in the sequence. When the 
memory array location of the peripheral that locked up the bus is checked 
30 (206) for a marker indicating a successful initialization during its last 
power-up, the system will determine that none was stored. Thus, to avoid 
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locking up the bus again, the system skips (218) powering up the failed 
component. 

While the present invention has been particularly shown and 
described with reference to particular embodiments thereof, it will be 
5 understood by those skilled in the art that various changes in form and 
details may be made therein without departing from the spirit and scope of 
the present invention. 
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What is claimed is: 
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